Overview
Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 10000 |
| Missing cells | 1280 |
| Missing cells (%) | 0.6% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.6 MiB |
| Average record size in memory | 168.0 B |
Variable types
| Text | 1 |
|---|---|
| Numeric | 12 |
| Categorical | 7 |
| DateTime | 1 |
debt_to_income is highly overall correlated with income and 1 other fields | High correlation |
income is highly overall correlated with debt_to_income and 1 other fields | High correlation |
loan_amount is highly overall correlated with debt_to_income | High correlation |
target_default_risk is highly overall correlated with income | High correlation |
recent_default is highly imbalanced (72.6%) | Imbalance |
income has 318 (3.2%) missing values | Missing |
savings has 311 (3.1%) missing values | Missing |
monthly_expenses has 325 (3.2%) missing values | Missing |
credit_score has 326 (3.3%) missing values | Missing |
customer_id has unique values | Unique |
num_dependents has 2984 (29.8%) zeros | Zeros |
signup_dayofweek has 1454 (14.5%) zeros | Zeros |
Reproduction
| Analysis started | 2025-12-13 14:39:34.827684 |
|---|---|
| Analysis finished | 2025-12-13 14:40:02.382745 |
| Duration | 27.56 seconds |
| Software version | ydata-profiling vv4.18.0 |
| Download configuration | config.json |
Variables
customer_id
Text
Unique
| Distinct | 10000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.3 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 10000 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | CUST006253 |
|---|---|
| 2nd row | CUST004685 |
| 3rd row | CUST001732 |
| 4th row | CUST004743 |
| 5th row | CUST004522 |
| Value | Count | Frequency (%) |
| cust005052 | 1 | < 0.1% |
| cust005312 | 1 | < 0.1% |
| cust002434 | 1 | < 0.1% |
| cust006950 | 1 | < 0.1% |
| cust000770 | 1 | < 0.1% |
| cust001686 | 1 | < 0.1% |
| cust008323 | 1 | < 0.1% |
| cust005579 | 1 | < 0.1% |
| cust004427 | 1 | < 0.1% |
| cust000467 | 1 | < 0.1% |
| Other values (9990) | 9990 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 23999 | |
| C | 10000 | |
| U | 10000 | |
| S | 10000 | |
| T | 10000 | |
| 1 | 4001 | 4.0% |
| 6 | 4000 | 4.0% |
| 4 | 4000 | 4.0% |
| 2 | 4000 | 4.0% |
| 8 | 4000 | 4.0% |
| Other values (4) | 16000 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 23999 | |
| C | 10000 | |
| U | 10000 | |
| S | 10000 | |
| T | 10000 | |
| 1 | 4001 | 4.0% |
| 6 | 4000 | 4.0% |
| 4 | 4000 | 4.0% |
| 2 | 4000 | 4.0% |
| 8 | 4000 | 4.0% |
| Other values (4) | 16000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 23999 | |
| C | 10000 | |
| U | 10000 | |
| S | 10000 | |
| T | 10000 | |
| 1 | 4001 | 4.0% |
| 6 | 4000 | 4.0% |
| 4 | 4000 | 4.0% |
| 2 | 4000 | 4.0% |
| 8 | 4000 | 4.0% |
| Other values (4) | 16000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 100000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 23999 | |
| C | 10000 | |
| U | 10000 | |
| S | 10000 | |
| T | 10000 | |
| 1 | 4001 | 4.0% |
| 6 | 4000 | 4.0% |
| 4 | 4000 | 4.0% |
| 2 | 4000 | 4.0% |
| 8 | 4000 | 4.0% |
| Other values (4) | 16000 |
age
Real number (ℝ)
| Distinct | 57 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 45.8616 |
| Minimum | 18 |
|---|---|
| Maximum | 74 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.3 KiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 32 |
| median | 46 |
| Q3 | 60 |
| 95-th percentile | 72 |
| Maximum | 74 |
| Range | 56 |
| Interquartile range (IQR) | 28 |
Descriptive statistics
| Standard deviation | 16.457987 |
|---|---|
| Coefficient of variation (CV) | 0.35886203 |
| Kurtosis | -1.1934533 |
| Mean | 45.8616 |
| Median Absolute Deviation (MAD) | 14 |
| Skewness | 0.017032931 |
| Sum | 458616 |
| Variance | 270.86533 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 54 | 209 | 2.1% |
| 73 | 209 | 2.1% |
| 53 | 199 | 2.0% |
| 36 | 194 | 1.9% |
| 34 | 192 | 1.9% |
| 48 | 191 | 1.9% |
| 46 | 190 | 1.9% |
| 25 | 189 | 1.9% |
| 21 | 187 | 1.9% |
| 58 | 186 | 1.9% |
| Other values (47) | 8054 |
| Value | Count | Frequency (%) |
| 18 | 165 | |
| 19 | 177 | |
| 20 | 186 | |
| 21 | 187 | |
| 22 | 177 | |
| 23 | 166 | |
| 24 | 174 | |
| 25 | 189 | |
| 26 | 185 | |
| 27 | 168 |
| Value | Count | Frequency (%) |
| 74 | 181 | |
| 73 | 209 | |
| 72 | 165 | |
| 71 | 172 | |
| 70 | 179 | |
| 69 | 166 | |
| 68 | 163 | |
| 67 | 159 | |
| 66 | 167 | |
| 65 | 148 |
income
Real number (ℝ)
High correlation Missing
| Distinct | 9107 |
|---|---|
| Distinct (%) | 94.1% |
| Missing | 318 |
| Missing (%) | 3.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 59712.871 |
| Minimum | 20001 |
|---|---|
| Maximum | 402769 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.3 KiB |
Quantile statistics
| Minimum | 20001 |
|---|---|
| 5-th percentile | 22169.1 |
| Q1 | 31300.5 |
| median | 47301.5 |
| Q3 | 75164.25 |
| 95-th percentile | 140669.3 |
| Maximum | 402769 |
| Range | 382768 |
| Interquartile range (IQR) | 43863.75 |
Descriptive statistics
| Standard deviation | 39865.231 |
|---|---|
| Coefficient of variation (CV) | 0.66761538 |
| Kurtosis | 5.8091939 |
| Mean | 59712.871 |
| Median Absolute Deviation (MAD) | 19159 |
| Skewness | 1.9817283 |
| Sum | 5.7814002 × 108 |
| Variance | 1.5892367 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 21232 | 4 | < 0.1% |
| 20702 | 4 | < 0.1% |
| 21078 | 3 | < 0.1% |
| 33973 | 3 | < 0.1% |
| 74178 | 3 | < 0.1% |
| 27852 | 3 | < 0.1% |
| 22316 | 3 | < 0.1% |
| 31008 | 3 | < 0.1% |
| 29224 | 3 | < 0.1% |
| 36576 | 3 | < 0.1% |
| Other values (9097) | 9650 | |
| (Missing) | 318 | 3.2% |
| Value | Count | Frequency (%) |
| 20001 | 1 | |
| 20003 | 1 | |
| 20012 | 1 | |
| 20018 | 1 | |
| 20024 | 1 | |
| 20027 | 1 | |
| 20031 | 2 | |
| 20034 | 1 | |
| 20040 | 1 | |
| 20046 | 1 |
| Value | Count | Frequency (%) |
| 402769 | 1 | |
| 382106 | 1 | |
| 381592 | 1 | |
| 370073 | 1 | |
| 323812 | 1 | |
| 307941 | 1 | |
| 302852 | 1 | |
| 293504 | 1 | |
| 289761 | 1 | |
| 289570 | 1 |
savings
Real number (ℝ)
Missing
| Distinct | 6498 |
|---|---|
| Distinct (%) | 67.1% |
| Missing | 311 |
| Missing (%) | 3.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5039.9225 |
| Minimum | 0 |
|---|---|
| Maximum | 44644 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 263.4 |
| Q1 | 1476 |
| median | 3499 |
| Q3 | 6986 |
| 95-th percentile | 15009 |
| Maximum | 44644 |
| Range | 44644 |
| Interquartile range (IQR) | 5510 |
Descriptive statistics
| Standard deviation | 5041.7936 |
|---|---|
| Coefficient of variation (CV) | 1.0003713 |
| Kurtosis | 5.8899705 |
| Mean | 5039.9225 |
| Median Absolute Deviation (MAD) | 2415 |
| Skewness | 2.0173309 |
| Sum | 48831809 |
| Variance | 25419683 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1308 | 8 | 0.1% |
| 275 | 7 | 0.1% |
| 46 | 6 | 0.1% |
| 956 | 6 | 0.1% |
| 44 | 6 | 0.1% |
| 274 | 6 | 0.1% |
| 1803 | 6 | 0.1% |
| 723 | 6 | 0.1% |
| 362 | 6 | 0.1% |
| 3252 | 6 | 0.1% |
| Other values (6488) | 9626 | |
| (Missing) | 311 | 3.1% |
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 1 | 3 | |
| 2 | 1 | < 0.1% |
| 3 | 2 | |
| 4 | 3 | |
| 5 | 4 | |
| 6 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 2 | |
| 10 | 2 |
| Value | Count | Frequency (%) |
| 44644 | 1 | |
| 42908 | 1 | |
| 42272 | 1 | |
| 40688 | 1 | |
| 40319 | 1 | |
| 37859 | 1 | |
| 36842 | 1 | |
| 36346 | 1 | |
| 35580 | 1 | |
| 35513 | 1 |
monthly_expenses
Real number (ℝ)
Missing
| Distinct | 3068 |
|---|---|
| Distinct (%) | 31.7% |
| Missing | 325 |
| Missing (%) | 3.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2082.2096 |
| Minimum | 200 |
|---|---|
| Maximum | 28664 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.3 KiB |
Quantile statistics
| Minimum | 200 |
|---|---|
| 5-th percentile | 700.7 |
| Q1 | 1471 |
| median | 2007 |
| Q3 | 2557 |
| 95-th percentile | 3355 |
| Maximum | 28664 |
| Range | 28464 |
| Interquartile range (IQR) | 1086 |
Descriptive statistics
| Standard deviation | 1385.9918 |
|---|---|
| Coefficient of variation (CV) | 0.66563509 |
| Kurtosis | 130.18133 |
| Mean | 2082.2096 |
| Median Absolute Deviation (MAD) | 543 |
| Skewness | 9.1372786 |
| Sum | 20145378 |
| Variance | 1920973.2 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 200 | 124 | 1.2% |
| 1495 | 14 | 0.1% |
| 2064 | 13 | 0.1% |
| 1528 | 13 | 0.1% |
| 2361 | 12 | 0.1% |
| 1600 | 12 | 0.1% |
| 2568 | 11 | 0.1% |
| 1865 | 11 | 0.1% |
| 2208 | 11 | 0.1% |
| 2410 | 11 | 0.1% |
| Other values (3058) | 9443 | |
| (Missing) | 325 | 3.2% |
| Value | Count | Frequency (%) |
| 200 | 124 | |
| 202 | 1 | < 0.1% |
| 204 | 1 | < 0.1% |
| 209 | 1 | < 0.1% |
| 222 | 1 | < 0.1% |
| 223 | 1 | < 0.1% |
| 225 | 1 | < 0.1% |
| 227 | 1 | < 0.1% |
| 230 | 2 | < 0.1% |
| 232 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 28664 | 1 | |
| 27016 | 1 | |
| 25832 | 1 | |
| 25808 | 1 | |
| 25712 | 1 | |
| 25344 | 1 | |
| 24480 | 1 | |
| 23400 | 1 | |
| 22984 | 1 | |
| 21944 | 1 |
num_dependents
Real number (ℝ)
Zeros
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.2142 |
| Minimum | 0 |
|---|---|
| Maximum | 7 |
| Zeros | 2984 |
| Zeros (%) | 29.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.1089821 |
|---|---|
| Coefficient of variation (CV) | 0.91334386 |
| Kurtosis | 0.98351171 |
| Mean | 1.2142 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.94154018 |
| Sum | 12142 |
| Variance | 1.2298413 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3603 | |
| 0 | 2984 | |
| 2 | 2166 | |
| 3 | 891 | 8.9% |
| 4 | 275 | 2.8% |
| 5 | 57 | 0.6% |
| 6 | 19 | 0.2% |
| 7 | 5 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2984 | |
| 1 | 3603 | |
| 2 | 2166 | |
| 3 | 891 | 8.9% |
| 4 | 275 | 2.8% |
| 5 | 57 | 0.6% |
| 6 | 19 | 0.2% |
| 7 | 5 | 0.1% |
| Value | Count | Frequency (%) |
| 7 | 5 | 0.1% |
| 6 | 19 | 0.2% |
| 5 | 57 | 0.6% |
| 4 | 275 | 2.8% |
| 3 | 891 | 8.9% |
| 2 | 2166 | |
| 1 | 3603 | |
| 0 | 2984 |
credit_score
Real number (ℝ)
Missing
| Distinct | 9647 |
|---|---|
| Distinct (%) | 99.7% |
| Missing | 326 |
| Missing (%) | 3.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 650.15544 |
| Minimum | 363.07712 |
|---|---|
| Maximum | 850 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.3 KiB |
Quantile statistics
| Minimum | 363.07712 |
|---|---|
| 5-th percentile | 537.24136 |
| Q1 | 602.18989 |
| median | 649.80832 |
| Q3 | 697.53743 |
| 95-th percentile | 765.93973 |
| Maximum | 850 |
| Range | 486.92288 |
| Interquartile range (IQR) | 95.347538 |
Descriptive statistics
| Standard deviation | 69.918297 |
|---|---|
| Coefficient of variation (CV) | 0.10754089 |
| Kurtosis | -0.0916615 |
| Mean | 650.15544 |
| Median Absolute Deviation (MAD) | 47.685985 |
| Skewness | 0.011038873 |
| Sum | 6289603.7 |
| Variance | 4888.5682 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 850 | 28 | 0.3% |
| 630.7657636 | 1 | < 0.1% |
| 699.8175286 | 1 | < 0.1% |
| 784.5882204 | 1 | < 0.1% |
| 642.812785 | 1 | < 0.1% |
| 711.701025 | 1 | < 0.1% |
| 631.2163997 | 1 | < 0.1% |
| 682.6370631 | 1 | < 0.1% |
| 590.8627585 | 1 | < 0.1% |
| 612.9183196 | 1 | < 0.1% |
| Other values (9637) | 9637 | |
| (Missing) | 326 | 3.3% |
| Value | Count | Frequency (%) |
| 363.0771161 | 1 | |
| 380.3538533 | 1 | |
| 411.5078207 | 1 | |
| 418.9834002 | 1 | |
| 424.3875999 | 1 | |
| 425.9140449 | 1 | |
| 426.272623 | 1 | |
| 427.0226302 | 1 | |
| 428.066598 | 1 | |
| 429.7859321 | 1 |
| Value | Count | Frequency (%) |
| 850 | 28 | |
| 849.0942247 | 1 | < 0.1% |
| 848.9532333 | 1 | < 0.1% |
| 848.8956486 | 1 | < 0.1% |
| 848.4854394 | 1 | < 0.1% |
| 847.8610444 | 1 | < 0.1% |
| 844.8516738 | 1 | < 0.1% |
| 844.5416771 | 1 | < 0.1% |
| 841.9692053 | 1 | < 0.1% |
| 840.4055592 | 1 | < 0.1% |
loan_amount
Real number (ℝ)
High correlation
| Distinct | 7999 |
|---|---|
| Distinct (%) | 80.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16214.797 |
| Minimum | 1000 |
|---|---|
| Maximum | 441190 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.3 KiB |
Quantile statistics
| Minimum | 1000 |
|---|---|
| 5-th percentile | 1000 |
| Q1 | 8508.5 |
| median | 15174.5 |
| Q3 | 21843.75 |
| 95-th percentile | 31818.15 |
| Maximum | 441190 |
| Range | 440190 |
| Interquartile range (IQR) | 13335.25 |
Descriptive statistics
| Standard deviation | 16081.647 |
|---|---|
| Coefficient of variation (CV) | 0.99178836 |
| Kurtosis | 209.5393 |
| Mean | 16214.797 |
| Median Absolute Deviation (MAD) | 6669.5 |
| Skewness | 11.216031 |
| Sum | 1.6214797 × 108 |
| Variance | 2.5861936 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1000 | 740 | 7.4% |
| 12928 | 7 | 0.1% |
| 10000 | 6 | 0.1% |
| 21285 | 4 | < 0.1% |
| 20001 | 4 | < 0.1% |
| 14611 | 4 | < 0.1% |
| 9015 | 4 | < 0.1% |
| 18741 | 4 | < 0.1% |
| 19673 | 4 | < 0.1% |
| 22576 | 4 | < 0.1% |
| Other values (7989) | 9219 |
| Value | Count | Frequency (%) |
| 1000 | 740 | |
| 1004 | 1 | < 0.1% |
| 1005 | 1 | < 0.1% |
| 1013 | 1 | < 0.1% |
| 1021 | 1 | < 0.1% |
| 1024 | 1 | < 0.1% |
| 1029 | 1 | < 0.1% |
| 1035 | 1 | < 0.1% |
| 1049 | 1 | < 0.1% |
| 1060 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 441190 | 1 | |
| 404750 | 1 | |
| 400120 | 1 | |
| 338120 | 1 | |
| 315920 | 1 | |
| 291350 | 1 | |
| 283690 | 1 | |
| 276770 | 1 | |
| 263290 | 1 | |
| 254710 | 1 |
loan_term_months
Real number (ℝ)
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 45.642 |
| Minimum | 12 |
|---|---|
| Maximum | 72 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.3 KiB |
Quantile statistics
| Minimum | 12 |
|---|---|
| 5-th percentile | 12 |
| Q1 | 36 |
| median | 48 |
| Q3 | 60 |
| 95-th percentile | 72 |
| Maximum | 72 |
| Range | 60 |
| Interquartile range (IQR) | 24 |
Descriptive statistics
| Standard deviation | 15.475134 |
|---|---|
| Coefficient of variation (CV) | 0.33905469 |
| Kurtosis | -0.44393278 |
| Mean | 45.642 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | -0.19759142 |
| Sum | 456420 |
| Variance | 239.47978 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 48 | 2998 | |
| 36 | 2523 | |
| 60 | 2008 | |
| 72 | 1003 | 10.0% |
| 24 | 948 | 9.5% |
| 12 | 520 | 5.2% |
| Value | Count | Frequency (%) |
| 12 | 520 | 5.2% |
| 24 | 948 | 9.5% |
| 36 | 2523 | |
| 48 | 2998 | |
| 60 | 2008 | |
| 72 | 1003 | 10.0% |
| Value | Count | Frequency (%) |
| 72 | 1003 | 10.0% |
| 60 | 2008 | |
| 48 | 2998 | |
| 36 | 2523 | |
| 24 | 948 | 9.5% |
| 12 | 520 | 5.2% |
employment_years
Real number (ℝ)
| Distinct | 182 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.39701 |
| Minimum | 0 |
|---|---|
| Maximum | 21.5 |
| Zeros | 44 |
| Zeros (%) | 0.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.5 |
| Q1 | 2.7 |
| median | 5.1 |
| Q3 | 7.7 |
| 95-th percentile | 11.5 |
| Maximum | 21.5 |
| Range | 21.5 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.4136997 |
|---|---|
| Coefficient of variation (CV) | 0.63251683 |
| Kurtosis | -0.14701843 |
| Mean | 5.39701 |
| Median Absolute Deviation (MAD) | 2.5 |
| Skewness | 0.52890029 |
| Sum | 53970.1 |
| Variance | 11.653345 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.4 | 124 | 1.2% |
| 2.9 | 121 | 1.2% |
| 3.8 | 119 | 1.2% |
| 5.3 | 119 | 1.2% |
| 0.1 | 118 | 1.2% |
| 3.6 | 118 | 1.2% |
| 4.9 | 116 | 1.2% |
| 4.7 | 116 | 1.2% |
| 4.6 | 114 | 1.1% |
| 2.5 | 114 | 1.1% |
| Other values (172) | 8821 |
| Value | Count | Frequency (%) |
| 0 | 44 | 0.4% |
| 0.1 | 118 | |
| 0.2 | 101 | |
| 0.3 | 91 | |
| 0.4 | 78 | |
| 0.5 | 92 | |
| 0.6 | 83 | |
| 0.7 | 97 | |
| 0.8 | 89 | |
| 0.9 | 99 |
| Value | Count | Frequency (%) |
| 21.5 | 1 | |
| 19.5 | 1 | |
| 18.9 | 1 | |
| 18.4 | 1 | |
| 18.2 | 2 | |
| 17.9 | 1 | |
| 17.8 | 1 | |
| 17.7 | 1 | |
| 17.6 | 2 | |
| 17.5 | 1 |
home_ownership
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.3 KiB |
| RENT | |
|---|---|
| OWN | |
| MORTGAGE | |
| OTHER | 452 |
Length
| Max length | 8 |
|---|---|
| Median length | 5 |
| Mean length | 4.7918 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | RENT |
|---|---|
| 2nd row | RENT |
| 3rd row | OWN |
| 4th row | OWN |
| 5th row | MORTGAGE |
Common Values
| Value | Count | Frequency (%) |
| RENT | 4524 | |
| OWN | 2526 | |
| MORTGAGE | 2498 | |
| OTHER | 452 | 4.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| rent | 4524 | |
| own | 2526 | |
| mortgage | 2498 | |
| other | 452 | 4.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 7474 | |
| E | 7474 | |
| T | 7474 | |
| N | 7050 | |
| O | 5476 | |
| G | 4996 | |
| W | 2526 | 5.3% |
| M | 2498 | 5.2% |
| A | 2498 | 5.2% |
| H | 452 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 47918 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| R | 7474 | |
| E | 7474 | |
| T | 7474 | |
| N | 7050 | |
| O | 5476 | |
| G | 4996 | |
| W | 2526 | 5.3% |
| M | 2498 | 5.2% |
| A | 2498 | 5.2% |
| H | 452 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 47918 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| R | 7474 | |
| E | 7474 | |
| T | 7474 | |
| N | 7050 | |
| O | 5476 | |
| G | 4996 | |
| W | 2526 | 5.3% |
| M | 2498 | 5.2% |
| A | 2498 | 5.2% |
| H | 452 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 47918 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| R | 7474 | |
| E | 7474 | |
| T | 7474 | |
| N | 7050 | |
| O | 5476 | |
| G | 4996 | |
| W | 2526 | 5.3% |
| M | 2498 | 5.2% |
| A | 2498 | 5.2% |
| H | 452 | 0.9% |
education
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.3 KiB |
| Bachelors | |
|---|---|
| HS | |
| Masters | |
| Other | |
| PhD |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 6.3395 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | HS |
|---|---|
| 2nd row | Bachelors |
| 3rd row | Bachelors |
| 4th row | HS |
| 5th row | Masters |
Common Values
| Value | Count | Frequency (%) |
| Bachelors | 4443 | |
| HS | 2546 | |
| Masters | 1962 | |
| Other | 500 | 5.0% |
| PhD | 462 | 4.6% |
| Bachlors | 87 | 0.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| bachelors | 4443 | |
| hs | 2546 | |
| masters | 1962 | |
| other | 500 | 5.0% |
| phd | 462 | 4.6% |
| bachlors | 87 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 8454 | |
| r | 6992 | |
| e | 6905 | |
| a | 6492 | |
| h | 5492 | |
| c | 4530 | |
| l | 4530 | |
| B | 4530 | |
| o | 4530 | |
| H | 2546 | 4.0% |
| Other values (6) | 8394 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 63395 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| s | 8454 | |
| r | 6992 | |
| e | 6905 | |
| a | 6492 | |
| h | 5492 | |
| c | 4530 | |
| l | 4530 | |
| B | 4530 | |
| o | 4530 | |
| H | 2546 | 4.0% |
| Other values (6) | 8394 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 63395 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| s | 8454 | |
| r | 6992 | |
| e | 6905 | |
| a | 6492 | |
| h | 5492 | |
| c | 4530 | |
| l | 4530 | |
| B | 4530 | |
| o | 4530 | |
| H | 2546 | 4.0% |
| Other values (6) | 8394 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 63395 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| s | 8454 | |
| r | 6992 | |
| e | 6905 | |
| a | 6492 | |
| h | 5492 | |
| c | 4530 | |
| l | 4530 | |
| B | 4530 | |
| o | 4530 | |
| H | 2546 | 4.0% |
| Other values (6) | 8394 |
marital_status
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.3 KiB |
| Single | |
|---|---|
| Married | |
| Divorced | |
| Widowed |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.6514 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Single |
|---|---|
| 2nd row | Married |
| 3rd row | Single |
| 4th row | Married |
| 5th row | Single |
Common Values
| Value | Count | Frequency (%) |
| Single | 4486 | |
| Married | 4002 | |
| Divorced | 1000 | 10.0% |
| Widowed | 512 | 5.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| single | 4486 | |
| married | 4002 | |
| divorced | 1000 | 10.0% |
| widowed | 512 | 5.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 10000 | |
| e | 10000 | |
| r | 9004 | |
| d | 6026 | |
| g | 4486 | |
| l | 4486 | |
| n | 4486 | |
| S | 4486 | |
| a | 4002 | |
| M | 4002 | |
| Other values (6) | 5536 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 66514 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 10000 | |
| e | 10000 | |
| r | 9004 | |
| d | 6026 | |
| g | 4486 | |
| l | 4486 | |
| n | 4486 | |
| S | 4486 | |
| a | 4002 | |
| M | 4002 | |
| Other values (6) | 5536 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 66514 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 10000 | |
| e | 10000 | |
| r | 9004 | |
| d | 6026 | |
| g | 4486 | |
| l | 4486 | |
| n | 4486 | |
| S | 4486 | |
| a | 4002 | |
| M | 4002 | |
| Other values (6) | 5536 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 66514 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 10000 | |
| e | 10000 | |
| r | 9004 | |
| d | 6026 | |
| g | 4486 | |
| l | 4486 | |
| n | 4486 | |
| S | 4486 | |
| a | 4002 | |
| M | 4002 | |
| Other values (6) | 5536 |
region
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.3 KiB |
| East | |
|---|---|
| South | |
| North | |
| West |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.5002 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | West |
|---|---|
| 2nd row | East |
| 3rd row | East |
| 4th row | South |
| 5th row | West |
Common Values
| Value | Count | Frequency (%) |
| East | 2553 | |
| South | 2523 | |
| North | 2479 | |
| West | 2445 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| east | 2553 | |
| south | 2523 | |
| north | 2479 | |
| west | 2445 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 10000 | |
| o | 5002 | |
| h | 5002 | |
| s | 4998 | |
| E | 2553 | 5.7% |
| a | 2553 | 5.7% |
| S | 2523 | 5.6% |
| u | 2523 | 5.6% |
| N | 2479 | 5.5% |
| r | 2479 | 5.5% |
| Other values (2) | 4890 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 45002 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 10000 | |
| o | 5002 | |
| h | 5002 | |
| s | 4998 | |
| E | 2553 | 5.7% |
| a | 2553 | 5.7% |
| S | 2523 | 5.6% |
| u | 2523 | 5.6% |
| N | 2479 | 5.5% |
| r | 2479 | 5.5% |
| Other values (2) | 4890 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 45002 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 10000 | |
| o | 5002 | |
| h | 5002 | |
| s | 4998 | |
| E | 2553 | 5.7% |
| a | 2553 | 5.7% |
| S | 2523 | 5.6% |
| u | 2523 | 5.6% |
| N | 2479 | 5.5% |
| r | 2479 | 5.5% |
| Other values (2) | 4890 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 45002 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 10000 | |
| o | 5002 | |
| h | 5002 | |
| s | 4998 | |
| E | 2553 | 5.7% |
| a | 2553 | 5.7% |
| S | 2523 | 5.6% |
| u | 2523 | 5.6% |
| N | 2479 | 5.5% |
| r | 2479 | 5.5% |
| Other values (2) | 4890 |
recent_default
Categorical
Imbalance
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.3 KiB |
| 0 | |
|---|---|
| 1 | 470 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 9530 | |
| 1 | 470 | 4.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 9530 | |
| 1 | 470 | 4.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 9530 | |
| 1 | 470 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 9530 | |
| 1 | 470 | 4.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 9530 | |
| 1 | 470 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 9530 | |
| 1 | 470 | 4.7% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 6948 | |
| 0 | 3052 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 6948 | |
| 0 | 3052 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 6948 | |
| 0 | 3052 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 6948 | |
| 0 | 3052 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 6948 | |
| 0 | 3052 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 6948 | |
| 0 | 3052 |
signup_date
Date
| Distinct | 1982 |
|---|---|
| Distinct (%) | 19.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.3 KiB |
| Minimum | 2018-01-01 00:00:00 |
|---|---|
| Maximum | 2023-06-23 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
signup_dayofweek
Real number (ℝ)
Zeros
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.0119 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 1454 |
| Zeros (%) | 14.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.003986 |
|---|---|
| Coefficient of variation (CV) | 0.6653561 |
| Kurtosis | -1.2553989 |
| Mean | 3.0119 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.017084941 |
| Sum | 30119 |
| Variance | 4.01596 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 1488 | |
| 2 | 1461 | |
| 0 | 1454 | |
| 4 | 1430 | |
| 6 | 1420 | |
| 3 | 1385 | |
| 1 | 1362 |
| Value | Count | Frequency (%) |
| 0 | 1454 | |
| 1 | 1362 | |
| 2 | 1461 | |
| 3 | 1385 | |
| 4 | 1430 | |
| 5 | 1488 | |
| 6 | 1420 |
| Value | Count | Frequency (%) |
| 6 | 1420 | |
| 5 | 1488 | |
| 4 | 1430 | |
| 3 | 1385 | |
| 2 | 1461 | |
| 1 | 1362 | |
| 0 | 1454 |
debt_to_income
Real number (ℝ)
High correlation
| Distinct | 1261 |
|---|---|
| Distinct (%) | 12.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3581564 |
| Minimum | 0.004 |
|---|---|
| Maximum | 2.031 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.3 KiB |
Quantile statistics
| Minimum | 0.004 |
|---|---|
| 5-th percentile | 0.024 |
| Q1 | 0.132 |
| median | 0.275 |
| Q3 | 0.508 |
| 95-th percentile | 0.96 |
| Maximum | 2.031 |
| Range | 2.027 |
| Interquartile range (IQR) | 0.376 |
Descriptive statistics
| Standard deviation | 0.30260645 |
|---|---|
| Coefficient of variation (CV) | 0.84490031 |
| Kurtosis | 2.0629225 |
| Mean | 0.3581564 |
| Median Absolute Deviation (MAD) | 0.17 |
| Skewness | 1.3541286 |
| Sum | 3581.564 |
| Variance | 0.091570665 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.024 | 40 | 0.4% |
| 0.017 | 40 | 0.4% |
| 0.028 | 37 | 0.4% |
| 0.011 | 36 | 0.4% |
| 0.012 | 34 | 0.3% |
| 0.026 | 33 | 0.3% |
| 0.018 | 32 | 0.3% |
| 0.015 | 31 | 0.3% |
| 0.02 | 31 | 0.3% |
| 0.13 | 31 | 0.3% |
| Other values (1251) | 9655 |
| Value | Count | Frequency (%) |
| 0.004 | 5 | 0.1% |
| 0.005 | 4 | < 0.1% |
| 0.006 | 16 | |
| 0.007 | 22 | |
| 0.008 | 23 | |
| 0.009 | 22 | |
| 0.01 | 23 | |
| 0.011 | 36 | |
| 0.012 | 34 | |
| 0.013 | 27 |
| Value | Count | Frequency (%) |
| 2.031 | 1 | |
| 1.938 | 1 | |
| 1.919 | 2 | |
| 1.916 | 1 | |
| 1.866 | 1 | |
| 1.821 | 1 | |
| 1.82 | 1 | |
| 1.802 | 1 | |
| 1.782 | 1 | |
| 1.77 | 1 |
sin_age
Real number (ℝ)
| Distinct | 57 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.10038733 |
| Minimum | -0.99992326 |
|---|---|
| Maximum | 0.97384763 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 5473 |
| Negative (%) | 54.7% |
| Memory size | 78.3 KiB |
Quantile statistics
| Minimum | -0.99992326 |
|---|---|
| 5-th percentile | -0.993691 |
| Q1 | -0.7568025 |
| median | -0.15774569 |
| Q3 | 0.51550137 |
| 95-th percentile | 0.90929743 |
| Maximum | 0.97384763 |
| Range | 1.9737709 |
| Interquartile range (IQR) | 1.2723039 |
Descriptive statistics
| Standard deviation | 0.66742807 |
|---|---|
| Coefficient of variation (CV) | -6.6485288 |
| Kurtosis | -1.4372524 |
| Mean | -0.10038733 |
| Median Absolute Deviation (MAD) | 0.65185905 |
| Skewness | 0.13670034 |
| Sum | -1003.8733 |
| Variance | 0.44546023 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -0.7727644876 | 209 | 2.1% |
| 0.8504366206 | 209 | 2.1% |
| -0.8322674422 | 199 | 2.0% |
| -0.4425204433 | 194 | 1.9% |
| -0.255541102 | 192 | 1.9% |
| -0.9961646088 | 191 | 1.9% |
| -0.9936910036 | 190 | 1.9% |
| 0.5984721441 | 189 | 1.9% |
| 0.8632093666 | 187 | 1.9% |
| -0.4646021794 | 186 | 1.9% |
| Other values (47) | 8054 |
| Value | Count | Frequency (%) |
| -0.9999232576 | 180 | |
| -0.9961646088 | 191 | |
| -0.9936910036 | 190 | |
| -0.9824526126 | 185 | |
| -0.9775301177 | 170 | |
| -0.9589242747 | 177 | |
| -0.9516020739 | 147 | |
| -0.9258146823 | 154 | |
| -0.9161659367 | 164 | |
| -0.8834546557 | 164 |
| Value | Count | Frequency (%) |
| 0.9738476309 | 165 | |
| 0.9463000877 | 177 | |
| 0.9092974268 | 186 | |
| 0.8987080958 | 181 | |
| 0.8632093666 | 187 | |
| 0.8504366206 | 209 | |
| 0.8084964038 | 177 | |
| 0.7936678638 | 165 | |
| 0.7457052122 | 166 | |
| 0.7289690401 | 172 |
target_default_risk
Categorical
High correlation
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.3 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 5132 | |
| 0 | 4868 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 5132 | |
| 0 | 4868 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 5132 | |
| 0 | 4868 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 5132 | |
| 0 | 4868 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 5132 | |
| 0 | 4868 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 5132 | |
| 0 | 4868 |
Interactions
Correlations
| age | credit_score | debt_to_income | education | employment_years | has_credit_card | home_ownership | income | loan_amount | loan_term_months | marital_status | monthly_expenses | num_dependents | recent_default | region | savings | signup_dayofweek | sin_age | target_default_risk | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| age | 1.000 | -0.000 | 0.022 | 0.015 | 0.005 | 0.019 | 0.000 | -0.027 | 0.012 | -0.004 | 0.000 | -0.007 | 0.006 | 0.025 | 0.000 | 0.008 | 0.005 | -0.136 | 0.024 |
| credit_score | -0.000 | 1.000 | -0.005 | 0.015 | -0.018 | 0.000 | 0.000 | 0.004 | 0.002 | 0.005 | 0.000 | -0.000 | 0.004 | 0.000 | 0.018 | 0.011 | 0.009 | -0.005 | 0.087 |
| debt_to_income | 0.022 | -0.005 | 1.000 | 0.000 | -0.015 | 0.017 | 0.000 | -0.587 | 0.751 | -0.017 | 0.010 | -0.001 | 0.001 | 0.000 | 0.007 | 0.004 | -0.017 | -0.014 | 0.485 |
| education | 0.015 | 0.015 | 0.000 | 1.000 | 0.015 | 0.000 | 0.000 | 0.003 | 0.012 | 0.010 | 0.000 | 0.019 | 0.008 | 0.000 | 0.000 | 0.000 | 0.010 | 0.000 | 0.000 |
| employment_years | 0.005 | -0.018 | -0.015 | 0.015 | 1.000 | 0.016 | 0.000 | 0.006 | -0.009 | 0.013 | 0.000 | 0.004 | -0.004 | 0.000 | 0.028 | -0.002 | 0.012 | -0.029 | 0.025 |
| has_credit_card | 0.019 | 0.000 | 0.017 | 0.000 | 0.016 | 1.000 | 0.000 | 0.019 | 0.012 | 0.000 | 0.000 | 0.032 | 0.014 | 0.000 | 0.020 | 0.000 | 0.026 | 0.006 | 0.058 |
| home_ownership | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.016 | 0.007 | 0.006 | 0.004 | 0.000 | 0.000 | 0.000 | 0.007 | 0.005 | 0.020 | 0.018 |
| income | -0.027 | 0.004 | -0.587 | 0.003 | 0.006 | 0.019 | 0.000 | 1.000 | 0.005 | 0.009 | 0.000 | -0.004 | 0.006 | 0.034 | 0.000 | 0.003 | 0.006 | 0.010 | 0.735 |
| loan_amount | 0.012 | 0.002 | 0.751 | 0.012 | -0.009 | 0.012 | 0.016 | 0.005 | 1.000 | -0.009 | 0.000 | 0.009 | 0.007 | 0.014 | 0.000 | 0.004 | -0.020 | -0.011 | 0.025 |
| loan_term_months | -0.004 | 0.005 | -0.017 | 0.010 | 0.013 | 0.000 | 0.007 | 0.009 | -0.009 | 1.000 | 0.027 | -0.009 | -0.001 | 0.020 | 0.011 | -0.002 | -0.005 | 0.002 | 0.021 |
| marital_status | 0.000 | 0.000 | 0.010 | 0.000 | 0.000 | 0.000 | 0.006 | 0.000 | 0.000 | 0.027 | 1.000 | 0.013 | 0.000 | 0.000 | 0.014 | 0.009 | 0.000 | 0.000 | 0.000 |
| monthly_expenses | -0.007 | -0.000 | -0.001 | 0.019 | 0.004 | 0.032 | 0.004 | -0.004 | 0.009 | -0.009 | 0.013 | 1.000 | 0.012 | 0.000 | 0.000 | -0.013 | 0.003 | 0.009 | 0.000 |
| num_dependents | 0.006 | 0.004 | 0.001 | 0.008 | -0.004 | 0.014 | 0.000 | 0.006 | 0.007 | -0.001 | 0.000 | 0.012 | 1.000 | 0.017 | 0.016 | -0.004 | -0.002 | -0.007 | 0.000 |
| recent_default | 0.025 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.034 | 0.014 | 0.020 | 0.000 | 0.000 | 0.017 | 1.000 | 0.000 | 0.000 | 0.000 | 0.013 | 0.000 |
| region | 0.000 | 0.018 | 0.007 | 0.000 | 0.028 | 0.020 | 0.000 | 0.000 | 0.000 | 0.011 | 0.014 | 0.000 | 0.016 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.004 |
| savings | 0.008 | 0.011 | 0.004 | 0.000 | -0.002 | 0.000 | 0.007 | 0.003 | 0.004 | -0.002 | 0.009 | -0.013 | -0.004 | 0.000 | 0.000 | 1.000 | -0.009 | -0.007 | 0.046 |
| signup_dayofweek | 0.005 | 0.009 | -0.017 | 0.010 | 0.012 | 0.026 | 0.005 | 0.006 | -0.020 | -0.005 | 0.000 | 0.003 | -0.002 | 0.000 | 0.000 | -0.009 | 1.000 | -0.004 | 0.000 |
| sin_age | -0.136 | -0.005 | -0.014 | 0.000 | -0.029 | 0.006 | 0.020 | 0.010 | -0.011 | 0.002 | 0.000 | 0.009 | -0.007 | 0.013 | 0.000 | -0.007 | -0.004 | 1.000 | 0.000 |
| target_default_risk | 0.024 | 0.087 | 0.485 | 0.000 | 0.025 | 0.058 | 0.018 | 0.735 | 0.025 | 0.021 | 0.000 | 0.000 | 0.000 | 0.000 | 0.004 | 0.046 | 0.000 | 0.000 | 1.000 |
Missing values
Sample
| customer_id | age | income | savings | monthly_expenses | num_dependents | credit_score | loan_amount | loan_term_months | employment_years | home_ownership | education | marital_status | region | recent_default | has_credit_card | signup_date | signup_dayofweek | debt_to_income | sin_age | target_default_risk | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | CUST006253 | 30 | 66737.0 | 11155.0 | 2272.0 | 2 | 605.076204 | 26965.0 | 48 | 3.9 | RENT | HS | Single | West | 1 | 1 | 2020-07-05 | 6 | 0.404 | 0.141120 | 1 |
| 1 | CUST004685 | 22 | 70740.0 | 997.0 | 1934.0 | 1 | 683.291967 | 4681.0 | 36 | 0.7 | RENT | Bachelors | Married | East | 0 | 0 | 2018-10-03 | 2 | 0.066 | 0.808496 | 1 |
| 2 | CUST001732 | 68 | 38890.0 | 1929.0 | 1696.0 | 0 | 658.003360 | 12633.0 | 72 | 2.2 | OWN | Bachelors | Single | East | 0 | 1 | 2018-05-30 | 2 | 0.325 | 0.494113 | 0 |
| 3 | CUST004743 | 49 | 29049.0 | 6284.0 | 2485.0 | 1 | 707.477864 | 20881.0 | 36 | 2.7 | OWN | HS | Married | South | 0 | 1 | 2018-04-22 | 6 | 0.719 | -0.982453 | 0 |
| 4 | CUST004522 | 74 | 60063.0 | 924.0 | 3179.0 | 2 | 564.768511 | 19438.0 | 36 | 10.3 | MORTGAGE | Masters | Single | West | 0 | 0 | 2019-12-03 | 1 | 0.324 | 0.898708 | 1 |
| 5 | CUST006341 | 56 | 37852.0 | 4826.0 | 3055.0 | 3 | 686.863529 | 15328.0 | 48 | 1.3 | RENT | Masters | Single | South | 0 | 1 | 2018-11-08 | 3 | 0.405 | -0.631267 | 0 |
| 6 | CUST000577 | 19 | 64635.0 | 5240.0 | 2737.0 | 2 | 564.799942 | 23469.0 | 48 | 6.7 | RENT | HS | Divorced | North | 0 | 1 | 2021-12-14 | 1 | 0.363 | 0.946300 | 1 |
| 7 | CUST005203 | 44 | 58003.0 | 6113.0 | 1607.0 | 1 | 590.309854 | 1000.0 | 48 | 0.5 | OWN | Other | Single | East | 0 | 0 | 2019-10-02 | 2 | 0.017 | -0.951602 | 1 |
| 8 | CUST006364 | 18 | 41132.0 | 14936.0 | 1414.0 | 2 | 616.701030 | 22800.0 | 72 | 2.1 | OWN | PhD | Married | East | 0 | 1 | 2018-04-18 | 2 | 0.554 | 0.973848 | 0 |
| 9 | CUST000440 | 29 | 51038.0 | 12639.0 | 2514.0 | 1 | 604.902807 | 6050.0 | 60 | 6.0 | RENT | Masters | Single | East | 0 | 0 | 2022-02-05 | 5 | 0.119 | 0.239249 | 0 |
| customer_id | age | income | savings | monthly_expenses | num_dependents | credit_score | loan_amount | loan_term_months | employment_years | home_ownership | education | marital_status | region | recent_default | has_credit_card | signup_date | signup_dayofweek | debt_to_income | sin_age | target_default_risk | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 9990 | CUST008323 | 59 | 32742.0 | 1186.0 | 1812.0 | 1 | 631.216400 | 34003.0 | 48 | 5.0 | RENT | HS | Single | West | 0 | 0 | 2023-01-07 | 5 | 1.038 | -0.373877 | 0 |
| 9991 | CUST005579 | 25 | 46494.0 | 9050.0 | 2738.0 | 0 | 682.637063 | 20116.0 | 48 | 5.1 | RENT | Masters | Married | West | 0 | 1 | 2018-02-24 | 5 | 0.433 | 0.598472 | 1 |
| 9992 | CUST004427 | 46 | 100954.0 | NaN | 1049.0 | 2 | 590.862758 | 11164.0 | 72 | 11.7 | MORTGAGE | Bachelors | Single | South | 0 | 1 | 2020-02-26 | 2 | 0.111 | -0.993691 | 1 |
| 9993 | CUST000467 | 36 | 31124.0 | 2188.0 | 1427.0 | 1 | 612.918320 | 32198.0 | 36 | 0.3 | RENT | Bachelors | Single | East | 0 | 1 | 2023-04-23 | 6 | 1.034 | -0.442520 | 0 |
| 9994 | CUST006266 | 42 | 61252.0 | 1911.0 | 1042.0 | 1 | 774.953485 | 13243.0 | 48 | 0.9 | MORTGAGE | HS | Married | East | 0 | 0 | 2018-06-14 | 3 | 0.216 | -0.871576 | 1 |
| 9995 | CUST005735 | 54 | 44507.0 | 5975.0 | 2520.0 | 1 | 699.633352 | 31089.0 | 48 | 5.3 | RENT | HS | Single | East | 0 | 1 | 2020-02-27 | 3 | 0.699 | -0.772764 | 1 |
| 9996 | CUST005192 | 50 | 20651.0 | 10203.0 | 1020.0 | 3 | 680.774066 | 8977.0 | 60 | 9.6 | RENT | PhD | Divorced | North | 0 | 0 | 2018-08-23 | 3 | 0.435 | -0.958924 | 0 |
| 9997 | CUST005391 | 43 | 33827.0 | 3848.0 | 2562.0 | 1 | 655.562748 | 24319.0 | 60 | 4.3 | OTHER | HS | Married | West | 0 | 0 | 2019-01-18 | 4 | 0.719 | -0.916166 | 0 |
| 9998 | CUST000861 | 44 | 38273.0 | 18880.0 | 1060.0 | 2 | 653.277645 | 1000.0 | 24 | 11.4 | MORTGAGE | Other | Single | North | 0 | 1 | 2019-08-04 | 6 | 0.026 | -0.951602 | 0 |
| 9999 | CUST007271 | 30 | 53614.0 | 6201.0 | 1310.0 | 1 | 663.975556 | 5205.0 | 60 | 9.8 | RENT | Bachelors | Divorced | North | 0 | 1 | 2018-03-03 | 5 | 0.097 | 0.141120 | 1 |